8 research outputs found

    A Path Relinking Method for the Joint Online Scheduling and Capacity Allocation of DL Training Workloads in GPU as a Service Systems

    Get PDF
    The Deep Learning (DL) paradigm gained remarkable popularity in recent years. DL models are used to tackle increasingly complex problems, making the training process require considerable computational power. The parallel computing capabilities offered by modern GPUs partially fulfill this need, but the high costs related to GPU as a Service solutions in the cloud call for efficient capacity planning and job scheduling algorithms to reduce operational costs via resource sharing. In this work, we jointly address the online capacity planning and job scheduling problems from the perspective of cloud end-users. We present a Mixed Integer Linear Programming (MILP) formulation, and a path relinking-based method aiming at optimizing operational costs by (i) rightsizing Virtual Machine (VM) capacity at each node, (ii) partitioning the set of GPUs among multiple concurrent jobs on the same VM, and (iii) determining a due-date-aware job schedule. An extensive experimental campaign attests the effectiveness of the proposed approach in practical scenarios: costs savings up to 97% are attained compared with first-principle methods based on, e.g., Earliest Deadline First, cost reductions up to 20% are obtained with respect to a previously proposed Hierarchical Method and up to 95% against a dynamic programming-based method from the literature. Scalability analyses show that systems with up to 100 nodes and 450 concurrent jobs can be managed in less than 7 seconds. The validation in a prototype cloud environment shows a deviation below 5% between real and predicted costs

    W_SR: A QoS Based Ranking Approach for Cloud Computing Service

    Get PDF
    Cloud computing is a kind of computing model that promise accessing to information resources in request time and subscription basis. In this environment, there are different type of userĂ¢â‚¬â„¢s application with different requirements. In addition, there are different cloud Service providers which present spate services with various qualitative traits. Therefore determining the best cloud computing service for users with specific applications is a serious problem. Service ranking system compares the different services based on quality of services (QoS), in order to select the most appropriate service. In this paper, we propose a W_SR (Weight Service Rank) approach for cloud service ranking that uses from QoS features. Comprehensive experiments are conducted employing real-world QoS dataset, including more than 2500 web services over the world. The experimental results show that execution time of our approach is less than other approaches and it is more flexible and scalable than the others with increase in services or users

    Studying Risk Effect in Evaluating the Financing Methods of New Technology-Based Firms

    No full text
    Innovation & Prosperity Fund (IPfund) in Iran as a governmental organization aims to develop new technology-based firms (NTBF) by its available resources through financing these firms. The innovative projects which refer to IPfund for financing are in a stage which can receive both fixed rate facilities and partnership in the projects, i.e. profit loss sharing (PLS). Since this fund must protect its initial and real value of its capital against inflation rate, therefore, this study aims to examine the suitable financing methods with considering risk. For this purpose we study on risk assessment models to see how to use risk adjusted net present value for knowledge based projects. On this basis, the NPV of a project has been analyzed by taking into account the risk variables (sales revenue and the cost of fixed investment) and using Monte Carlo simulation. The results indicate that in most cases for a project, the risk adjusted NPV in partnership scenario is more than the other scenario. In addition to, partnership in projects which demand for industrial production facilities is preferable for the IPfund than projects calling for working capital
    corecore